OH  THE  FITTING  OF  PEARSON  CURVES  TO  SUMS  OF 
INDEPENDENT  RANDOM  VARIABLES 

BY 

THOMAS  SELLKE 


TECHNICAL  REPORT  NO.  333 
MAY  19,  1983 


Prepared  Under  Contract 
N00014-76-C-0475  (NR-042-267) 

For  the  Office  of  Naval  Research 


Reproduction  In  Whole  or  In  Part  Is  Permitted 
for  any  purpose  of  the  United  States  Government. 

Approved  for  public  release;  distribution  unlimited. 


DEPARTMENT  OF  STATISTICS 
Stanford  University 
Stanford,  California 


For 


•■w 


Accession 

STXS  GRAM 
)TIC  TA» 
Jnannouncad 


"'AvailabilUyJofies 

- rA*%il  an4/or 


r 


ON  THE  FITTING  OF  PEARSON  CURVES  TO  SUMS  OF 
INDEPENDENT  RANDOM  VARIABLES 
By 

Thomas  S el Ike 

Introduction  and  Summary. 


In  this  report,  we  attempt  to  answer  the  following  questions. 

1.  Is  the  sum  of  independent  beta  (Pearson  Type  I)  random 
variables  distributed  as  a  beta  random  variable? 

2.  How  well  is  the  distribution  of  a  sum  of  independent  betas 
approximated  by  a  beta  distribution? 

3.  If  two  or  more  independent  random  variables  are  best  fitted 
by  one  type  of  Pearson  curve,  is  their  sum  best  fitted  by  a  Pearson 
curve  of  the  same  type? 

Section  1  of  this  paper  shows  that  the  answer  to  the  first  question 
is  "no".  However,  the  calculations  and  computer  simulations  described 
in  section  2  show  that  the  sum  of  independent  beta  random  variables  often 
has  a  distribution  which  is  close  to  a  beta  distribution,  so  that  the 
answer  to  the  second  question  is  often  "very  well". 

Section  3  shows  that  the  answer  to  the  third  question  depends  on 
the  Pearson  curve  type  of  the  random  variables  and  on  whether  they  are 
identically  distributed.  Theorem  1  of  this  section  shows  that  the  sum 
of  independent,  identically  distributed  random  variables  of  Pearson  Type 
I,  II,  III  or  VII  is  best  fitted  by  a  Pearson  curve  of  the  same  type. 

This  is  "almost"  true  for  the  other  Pearson  types  in  a  certain  sense. 

When  the  independent  random  variables  to  be  added  are  not  identically 
distributed,  Pearson  curve  type  is  not  preserved  to  this  extent. 


However,  Theorem  2  and  Theorem  3  can  be  used  to  determine  the  possible 
Pearson  curve  types  of  the  sum  given  the  Pearson  curve  type  of  the 
summands.  There  is  some  interest  in  the  question  of  whether  the  sum  of 
independent  chi  random  variables  is  best  fitted  by  a  Pearson  Type  I 
distribution.  (A  single  chi  random  variable  is  best  fitted  by  a  Pearson 
Type  I.)  The  report  finishes  by  showing  that  Pearson  curves  of  Types  I, 
III,  IV,  V,  and  VI  can  be  best  fitting  for  a  sum  of  two  independent  chi 
random  variables. 

1.  Is  the  Sum  of  Independent  Beta  (Pearson  Type  I)  Random  Variables 
Distributed  as  a  Beta  Random  Variable? 

It  is  easy  to  exhibit  counterexamples,  such  as  a  sum  of  two  inde¬ 
pendent  U[0,1]  random  variables.  More  generally,  consider  m  indepen¬ 
dent  betas  with  intervals  of  support  [0,a^],  [O^] . [O.a^] .  It  seems 

to  be  the  case  that  the  density  of  the  sum  of  these  betas  will  not  be 
infinitely  differentiable  at  points  which  can  be  written  as  the  sum  of 
some  subset  of  the  a^'s.  Since  the  density  of  a  beta  is  infinitely 
differentiable  in  the  interior  of  its  interval  of  support,  this  would 
imply  that  a  sum  of  independent  betas  never  has  a  beta  distribution.  A 
rigorous  proof  of  this  claim  has  not  been  worked  out,  however. 

2.  How  Well  is  the  Distribution  of  a  Sum  of  Independent  Betas  Approxi- 
mated  by  a  Beta  Distribution? 


Percentage  points  were  found  for  the  Pearson  curves  whose  first  four 
moments  agreed  with  the  first  four  moments  of  various  test  distributions. 
These  values  are  compared  with  the  true  percentage  points  or  with  percen¬ 
tage  points  obtained  from  computer  simulation.  The  results  are  found  in 
Tables  1-4.  All  the  Pearson  curves  used  were  beta  distributions. 


Let  U^,  l^,  and  be  independent  U[0,1]  random  variables. 

Let  2  anc*  ^2  2  Beta(2,2)  random  variables  independent  of  each 

other  and  of  the  U^'s.  Table  1  gives  percentage  points,  the  Pearson 
curve  approximations  to  these  percentage  points,  and  the  true  per¬ 
centile  values  corresponding  to  the  Pearson  curve  values  for  four 
symmetric  test  distributions.  Note  that  the  Pearson  curve  approxi¬ 
mations  do  worst  for  +  ^2*  wbose  tent-shaped  density  does  not 
look  much  like  any  beta  density.  The  Pearson  curves  do  about  equally 
well  for  the  other  three  test  distributions. 

Table  2  gives  true,  computer  simulation,  and  Pearson  curve  per¬ 
centage  points  for  a  sum  of  two  8(1,3)  random  variables.  The 
computer  simulation  values  were  obtained  by  generating  two  independent 
random  numbers  uniformly  distributed  on  [0,1],  doing  a  transformation 
to  obtain  independent  random  numbers  with  a  8(1,3)  distribution, 
recording  the  sum,  and  iterating  this  procedure  10**  times.  The  other 
computer  simulations  were  done  in  the  same  way,  except  that  5  and  10 
independent  8(1,3)  random  numbers  were  added  in  each  of  the  10^ 
iterations.  The  table  shows  that  the  computer  simulation  percentage 
points  are  in  very  good  agreement  with  the  true  percentage  points. 

The  Pearson  curve  values  are  not  as  good,  especially  in  the  lower 
tail. 

Tables  3  and  4  give  computer  simulation  and  Pearson  curve  percen¬ 
tage  points  for  sums  of  5  and  10  i.i.d.  6(1,3)  random  variables,  res¬ 
pectively.  The  true  percentage  points  were  not  found  because  the 
calculations  would  have  been  too  messy,  but  Table  2  shows  that  the 
computer  simulation  values  should  be  quite  close  to  the  true  ones. 


3 


Table  4  also  Includes  percentage  points  obtained  from  the  Edgeworth 

-1/2  -1 

expansion  with  Edgeworth  correction  terms  of  orders  n  ,  n  , 

-3/2 

and  n  .  The  different  methods  show  very  good  agreement  in  both 
tables. 

These  results  give  an  indication  of  how  well  the  distribution 
of  a  sum  of  i.i.d.  0(p,q)  random  variables  is  approximated  by  a 
beta  distribution  when  p  and  q  are  small  positive  integers. 

The  Pearson  curve  approximation  for  a  sum  of  two  such  betas  gives 
only  rough  agreement  with  the  true  percentage  points.  One  explana¬ 
tion  of  this  behavior  is  that  the  density  for  a  sum  of  two  such 
betas  exhibits  a  lack  of  "smoothness"  at  1.  For  example,  the  "tent- 
function"  density  of  +  Uj  does  not  have  a  first  derivative  at 
1,  while  the  sum  of  two  3(1,3)  random  variables  does  not  have  a 
third  derivative  at  1.  Thus,  it  is  not  surprising  that  such  a 
density  is  not  well  approximated  by  a  beta  density,  which  is  necessarily 
infinitely  differentiable  in  its  interval  of  support.  As  the  number 
of  lid  betas  which  are  added  together  increases,  the  smoothing  effect 
of  convolution  on  the  density  and  the  approach  of  the  distribution 
toward  the  normal  distribution  makes  the  approximation  by  a  beta 
better.  Changing  from  integer  values  for  p  and  q  to  real  numbers 
of  similar  size  should  not  seriously  affect  the  quality  of  the  approxi¬ 
mations. 

Moderate  deviations  from  the  identically  distributed  case  should 
not  make  much  difference  either,  although  the  next  section  will  show 
that  the  Pearson  curve  which  best  fits  a  sum  of  Independent,  non- 
identlcally  distributed  betas  is  not  always  itself  a  beta.  If  p  and 


q  are  both  very  small  positive  numbers,  it  could  be  necessary  to  add 
a  large  number  of  these  betas  together  before  the  sum  distribution  is 
smooth  enough  to  be  close  to  a  beta.  To  take  an  extreme  example, 
consider  p  *  q  ■  10  **.  Such  a  3(p,q)  puts  almost  all  of  its  mass 
very  close  to  0  or  to  1.  The  distribution  of  a  sum  of  k  such 
betas  would  concentrate  its  mass  close  to  the  integers  0,l,2,...,k 
unless  k  were  quite  large. 


Table  1 


True  percentage  points,  Pearson  curve  approximations  to  these 
percentage  points,  and  true  percentiles  for  the  Pearson  curve  vaues 
for  four  sum  distributions. 


Ul  +  °2 

Ul  +  U2  +  U3 

VS2.2 

B2.2  +  g2.2 

Kurtosls 

2.4 

2.6 

2.4107 

2.5714 

Range 

[0,2] 

10,3] 

[0,2] 

[0,2] 

True  0.25%  point 

.0707 

.2466 

.1390 

.2112 

Pearson  value 

.0348 

.2318 

.1331 

.1990 

True  %  for  Pearson 

.06% 

.21% 

.22% 

.20% 

True  o.5%  point 

.1000 

.3107 

.1763 

.2536 

Pearson  value 

.0789 

.3077 

.1737 

.2512 

True  %  for  Pearson 

.31% 

.49% 

.48% 

.48% 

True  i.o%  point 

.1414 

.3915 

.2242 

.3052 

Pearson  value 

.1342 

.3966 

.2241 

.3056 

True  %  for  Pearson 

.90% 

1.04% 

1.00% 

1.00% 

True  2.5%  point 

.2236 

.5314 

.3092 

.3918 

Pearson  value 

.2305 

.5402 

.3110 

.3944 

True  %  for  Pearson 

2.66% 

2.63% 

2.54% 

2.56% 

True  5.o%  point 

.3162 

.6694 

.3966 

.4760 

Pearson  value 

.3277 

.6752 

.3986 

.4785 

True  %  for  Pearson 

5.37% 

5.13% 

5.07% 

5.09% 

True  io.O%  point 

.4472 

.8434 

.5123 

.5824 

Pearson  value 

.4554 

.8428 

.5133 

.5832 

True  %  for  Pearson 

10.37% 

9.98% 

10.05% 

10.04% 

25.0% 

.7071 

1.1471 

.7338 

.7761 

Pearson  value 

.7003 

1.1452 

.732 

.775 

True  %  for  Pearson 

24.52% 

24.88% 

24.90% 

24.85% 

Table  2 


_*  _»  _« 


True,  computer  simulation  (10  rep.)»  and  Pearson  curve 
percentage  points  for  a  sum  of  two  8(1,3)  r.v.'s. 


Percent  True 


Computer 

Simulation 


Pearson 

Curve 


True  Percentiles  of 
Pearson  Curve  Values 


.022% 

.19% 

.67% 

2.28% 

4.98% 

10.26% 

25.34% 

49.83% 

74.78% 

90.14% 

95.14% 

97.55% 

98.98% 

99.47% 

99.72% 


Table  3 


Percentage  points  for  a  sum  of  5  iid  0(1,3)  random  variables 


Percent 


Computer 

Simulation 


Pearson 

Curve 


Table  4 


Percentage  points  for  a  sum  of  10  lid  3(1,3)  random  variables. 


i 

t 


Computer  Pearson  Edgeworth 

Percent  Simulation  Curve  Expansion 


If  Two  or  More  Independent  Random  Variables  are  Best  Fitted  tn 


One  Type  of  Pearson  Curve,  is  Their  Sum  Best  Fitted  by  a  Pearson 
Curve  of  the  Same  Type? 


The  answer  to  this  question  will  depend  on  the  type  of  Pearson 
curve  which  best  fits  the  summand  random  variables.  However,  before 
the  investigation  of  this  question  can  begin,  it  will  be  necessary  to 
establish  some  notation  and  to  make  some  background  remarks  concerning 
the  Pearson  curve  system. 

Let  and  X ^  be  independent  random  variables  with  finite 

fourth  moments.  Let  K^,  Kj,  X3»  and  be  the  first  four  cumulants 

of  X^.  Let  L^,  L2,  L^,  and  L^  be  the  first  four  cumulants  of  X2. 
The  first  four  cumulants  of  X^  +  X2  will  be  +  L^,  +  L2, 

K3  +  Lj ,  and  +  L^ .  Let  ,  and  /ff^  be  the  skewness 

/v 

values  for  X^,  X2,  and  x^+x2  respectively.  Let  B^t  B2>  and  B2 
be  the  kurtosis  values  for  X^,  X2,  and  xi+x2>  respectively.  Recall 
that  and  are  defined  by 


and 


The  other  skewness  and  kurtosis  values  are  defined  analogously.  The 
symbols  «/s^  and  B2  will  be  used  as  generic  symbols  for  skewness 
and  kurtosis. 

A  Pearson  curve  is  uniquely  determined  by  its  first  four  moments. 
Thus,  a  natural  way  to  fit  a  Pearson  curve  to  a  probability  distribution 


is  to  find  the  Pearson  curve  whose  first  four  moments  match  those  of 


the  distribution.  In  this  discussion^ the  "best  fitting"  Pearson  curve 
will  be  defined  to  be  the  one  found  in  this  way.  However,  other 
fitting  methods  are  sometimes  used.  For  example,  Pearson  curves 
are  sometimes  fitted  to  chi  random  variables  so  as  to  match  the 
first  three  moments  subject  to  the  constraint  that  0  be  the  left 
endpoint  of  the  interval  of  support . 


Up  to  location  ar>  1  s^aie,  the  Pearson  curve  which  best  fits  a 
distribution  is  determined  by  the  skewness  i/b7  and  the  kurtosis  B2 
of  the  distribution.  Since  the  type  of  a  Pearson  curve  is  location 
and  scale  invariant,  /B^  and  S2  determine  the  type. 

The  following  formulas,  taken  from  Johnson  and  Kotz  (1970),  show 
how  to  find  Pearson  curve  type  from  ans*  B2-  Define  co,cl,c2’ 

and  K  by 


c0  "  (4B2-3BP  (10B2-12B1-18)^ 

cx  -  (B2+3)(10B2-12B1-18)‘1  4^ 

c2  -  (2B2-3B1-6)(10S2-12B1-18)"1 


12,  .-1 
K  "  4  C1  (coc2) 


Type  I:  k  <  0  ,  which  is  equivalent  to  2S2-38^-6  <  0  • 


Type  II:  &1  -  0,  B2  <  3 


Type  III:  2&2-3S,1-6  -  0 


11 


Type  IV: 


0  <  K  <  1  . 


Type  V:  K  *  1  . 

Type  VI:  k  >  1  . 

Type  VII:  0X  -  0,  02  >  3  . 

The  classification  of  pairs  implied  by  these  formulas  is 

displayed  graphically  on  the  next  two  pages,  which  are  taken  from  Rhind 
(1909).  The  "limit  for  all  frequency  distributions"  line  has  been 
added  to  Rhind* s  version  of  Figure  1.  The  text  of  Rhind' s  paper 
indicates  that  existence  of  this  limiting  line  was  not  known  in  1909. 
The  line  labeled  V  in  Figure  1  may  look  like  it  is  not  quite  straight 
because  of  sloppiness  on  Rhind' s  part,  but  this  is  not  the  case.  This 
curve  is  the  solution  to  the  cubic  equation 

8^2+3) 2  =  4(402-38^(202-33^)  . 

The  curve  is  also  shown  on  Figure  2,  where  it  is  more  obvious  that  it 
is  not  straight.  The  line  labeled  III  is  straight,  however, 

The  kurtosis  82  does  not  seem  to  be  a  convenient  parameter  for 

the  purposes  of  this  discussion.  For  this  reason,  let  us  define  Y', 

/\ 

y",  and  Y  by 

Y'  -  8^-3  ,  Y"  “  8^-3  ,  and  Y  -  82~3  . 

Thus,  the  Y  parameters  are  related  to  the  cumulants  by 


mi 


L©2 


Ml 


Hetirotvpic 


m 


This  diagram,  taken  from  Rhind  (1909),  shows  how  8^  and  8- 
determine  Pearson  curve  type. 

"U-shaped"  betas  fall  in  U^. 

"J-shaped"  betas  fall  in  J^. 

Other  betas  fall  in  1^. 

For  all  distributions,  (B^B^  satisfies  B^Bj-1  <  °* 
Pearson  curves  for  which  (B^,B2)  falls  below  the  "Bg  * 
line  have  an  infinite  8th  moment. 

Figure  1. 


13 


VL4 


(k2+l2) 


2  ' 


One  can  think  of  this  y  parameter  as  being  a  normalized  fourth  cumu- 
lant  in  the  same  way  that  /6^  is  a  normalized  third  cumulant. 

The  Pearson  curve  corresponding  to  a  given  distribution  is  of 
course  specified,  up  to  location  and  scale,  by  the  values  of  /B^ 
and  y  of  the  distribution.  When  one  works  in  terms  of  8^  and  y 
instead  of  in  terms  of  8^  and  82,  Figure  1  is  replaced  by  Figure  3. 

Let  us  subdivide  the  region  in  the  (8-^.y)  plane  which  corres¬ 
ponds  to  Type  I  distributions  into  the  regions  I  ,  I+,  and  1^. 

(See  Figure  4.)  I  is  the  part  of  the  Type  I  region  where  y  <  0, 

I+  is  the  part  of  the  Type  I  region  where  y  >  0,  and  1^  is 
the  part  of  the  Type  I  region  where  y  *  0.  These  subregions  have 
no  known  significance  with  respect  to  the  shapes  of  the  Pearson  curves 
they  contain.  Their  importance  arises  solely  from  the  question  to  be 
Investigated. 

If  8^  ■  0,  the  Pearson  curve  type  is  determined  by  the  sign 
of  y: 

a  0,  y  <  0  implies  Type  II. 

8^*0,  y  ■  0  implies  Type  G  (normal  distribution) 

B2  “  0,  y  >  0  implies  Type  VII. 


If  8^  >  0,  the  Pearson  curve  type  is  "almost"  determined  by 

the  ratio  : 

gl 


<  0  implies  Type  I  . 


This  diagram,  taken  from  Rhind  (1909),  shows  how  the  Type  I  region 
in  the  (BI,y)  plane  is  divided  into  the  regions  i"  and  I+  and  the 
line  1°. 


Figure  4. 


•jr1-  •  0  implies  Type  1 
P1 


Y  3  + 

0  <  a  <  o’  Implies  Type  I  . 

h  2 


-X.  2 

B1  2 


implies  Type  III! 


2  Bi 


implies  Type  VI,  Type  V,  or  Type  IV. 


Now  if  one  restricts  attention  to  that  part  of  the  (3^,y)  plane 
shown  in  Figure  4,  there  exists  some  small  number  e  >  0  such  that 


A  <  _X  <  2-e 

2  h 


implies  Type  VI  (true  even  when  3^  >  1.8), 


2-e  <  2  +  e  is  implied  by  Type  V, 


2  +  e  </ 
B1 


Implies  Type  IV. 


This  completes  the  necessary  background  remarks,  so  that  we  can 
finally  procede  to  the  question  of  Interest.  To  begin,  let  us  consider 
what  happens  when  and  X2  are  iid,  or,  to  restrict  attention  to 

what  is  relevant  here,  when  X^  and  X£  are  such  that  ■  Lj, 

*  Lj,  and  ■  L^.  In  this  case,  we  have 


K.+L,  2K,  ,  ., 

A  4  4  4  y  Y 

2  2  2  2 

(K^L,)^  4K?  2  2 


Ls 
tv' 
r.  t 

L  * 
& 


fii 

> 


;n 

- 


.• 


6,  - 


<VV‘ 


AK 

_ 3 

(K2+L2)3  8k!| 


*1 


If  8^  -  0,  this  Implies  8^  ■  0  and  sign(y)  -  sign(Y').  If  8^  >  0, 
this  implies  -  Xj  . 

8i  B1 

Thus,  the  Types  II,  G,  and  VII,  which  occur  when  8^  ■  0,  are 
preserved  under  addition  of  two  lid  random  variables.  The  same  is 
true  of  Types  1,1°,  I+,  and  III,  which  are  characterized  by  the  value 

v 

of  the  ratio  -g-1-  ,  since  this  ratio  is  preserved  under  addition  of  two 

iid  random  variables.  The  Types  VI,  V,  and  IV  are  "almost"  preserved 

in  the  same  sense  that  they  are  "almost"  determined  by  the  ratio  . 

P1 

Thus,  Type  VI  random  variables  for  which  3/2  <  <  2-e  are  pre- 

°1 

served  under  addition  in  this  sense.  The  same  is  true  for  Type  IV 
random  variables  for  which  2+e  <  and  0  <  8^  £  1.8.  Type  V 

random  variables  will  almost  never  be  preserved,  but  the  sum  of  two 
iid  Type  V's  for  which  0  <  8-^  £  1.8  will  be  very  close  to  a  Type  V 
distribution  with  respect  to  its  first  four  moments.  The  second  deriva¬ 
tive  of  the  Type  V  curve  is  negative  close  to  8^  ■  0  and  is  positive 
when  8X  is  large,  so  there  will  be  at  least  one  point  (8^,Y)  on  the 

8i 

curve  for  which  (— ,  2)  is  also  on  the  Type  V  curve. 

If  n  lid  random  variables  are  added,  the  8^  and  y  values 
for  the  sum  are  equal  to  ^  times  the  corresponding  values  for  the  sum¬ 
mands.  It  follows  that  all  of  the  above  results  hold  when  n  Instead 
of  just  two  lid  random  variables  are  added  together.  Let  us  record 
this  formally  as 


19 


Theorem  1.  Suppose  a  random  variable  X  is  best  fitted  by  a  Pearson 
curve  of  Type  I,  II,  III,  or  VII.  If  n  iid  copies  of  X  are 
added  together,  the  sum  is  best  fitted  by  a  Pearson  curve  of  the  same 
type.  The  same  is  "almost"  time  for  Pearson  curve  Types  IV,  V,  and 
VI  in  the  sense  described  above. 

When  X^  and  X^  are  not  iid,  matters  become  more  complicated. 
The  relationship  between  the  (B^.y')  and  (3” , 3")  pairs  of  the 

A  A 

summands  and  the  ($^,y)  pair  of  the  sum  is  not  so  easily  described 
as  in  the  iid  case.  The  key  result  here  will  be  Theorem  3,  although 
Theorem  2  will  be  useful  also. 

Theorem  2.  3^  <_  max{3^,B^} ,  and  |y|  <_  max{  | y*  |  ,  jy"|  }  . 


Proof.  Suppose  3^  —  •  In  terms  of  the  cumulants,  this  means 


so  that 


3/2 


Thus, 


O^+Lj)2 

1  (k2+l2)3 

(|k  Ml  l)2 

< - — 

<K2+L2r 


L-  3/2  2 


< 


-*23 


(k3/2  +  l3/2)‘ 
(k2+l2)3 


i  6i 


k3/2  +  l3/2 
K2  +  L2 


<k2+l2)3/^| 


Since 


3/2 

x 


is  a  convex  increasing  function  of  x  for  x  ^  0, 


(k2+l2) 


3/2 


This  and  the  above  imply 


<  8^  »  max{B^,Bp  . 

The  proof  of  the  second  assertion  is  similar.  §$ 

Theorem  3.  If  y*  and  y"  have  the  same  sign  (positive,  negative, 

A 

or  0),  then  y  also  has  this  sign,  and 


A 


Here,  |^j  is  to  be  interpreted  as  00  for  every  a  e  H .  If  the 
sign  of  y'  and  y"  is  not  0,  then  equality  holds  if  and  only  if 


Proof.  The  proof  is  trivial  when  sign(y')  *  sign(y")  »  0. 
r  oppose  that  y'  >  0  and  y"  >  0.  This  implies 

Note  that  0^  ■  is  never  negative.  Thus,  the  assertion 

is  equivalent  to 


A 


Y 


< 


Let 


c  » 


9 


Then 

(1) 


c 


if  and  only  if 

/S  A 

0:  <  cy 

if  and  only  if 

(k3+l3)2  k4+l4 

3  —  C  2 

(K2+L2r  (k2+l2> 

if  and  only  if 

(K3+L3)2  <  c(K4+L4)(K2+L2) 

if  and  only  if 


22 


-<> 


(2)  uq'2  +  Lj/z  J&p2  <  c(K^Y,+LZY")(K2+L2) 

if 

(3)  <K*/2  /$[  +  l2/2  Su[)2  <  (k28(+l26»)(k2+l2) 


if  and  only  if 


K2e;+L2Bi 


+  2k; 


if  and  only  if 


2k32/2t?2/2  ^  <  l2kzB'+k2l^ 


if  and  only  if 


2K 


1/2. 1/2 
2  L2 


S$l  /gj  <  Vl  +  L2$l 


if  and  only  if 


(A) 


Equation  (4)  la  always  true.  Following  the  chain  of  implications 


back  up  shows  equation  (1)  is  always  true. 


If  K2/2»f J  *  l2/2^J  ,  then  we  have  strict  inequality  in  (4). 
Strict  inequality  in  (4)  implies  strict  inequality  in  all  the  pre¬ 
ceding  steps.  Note  that  k2/2/?[  -  l2/2^  is  equivalent  to 

k3  l3  8{  8£ 

—  ■  —  .  If  — r  ^  ~ h  »  then  we  get  strict  inequality  in  (2)  when 
K2  "2  Y  Y 

we  go  up  from  (3)  to  (2) .  Strict  inequality  in  (2)  implies  strict 

K4  L4  *1  B1 

inequality  in  (1) .  Note  that  —  =  - —  is  equivalent  to  —y  =  — n- 

L2  ■  » 

K3  L3 

when  —  *  —  holds.  This  shows  that  the  "only  if"  part  of  the 
K2  L2 

last  assertion. 

If  .  -1  and  -A  m  -A  both  hold,  then  K “  li/2/0lr 

and  “  yT  •  This  implies  inequality  in  (4)  and  in  all  the  preceding 

steps.  This  finishes  the  case  y'  >  0  and  y"  >  0.  The  proof  for  the 
case  y'  <  0  and  y"  <  0  is  similar.  §| 

It  follows  from  Theorem  3  that  if  independent  random  variables 
are  best  fitted  by  Type  II  Pearson  curves,  then  their  sum  is  also 
best  fitted  by  a  Type  II  Pearson  curve.  The  same  holds  for  Type  VII, 
for  the  union  of  Type  1°  and  Type  G,  and  for  the  union  of  Type  I  and 
Type  II.  These  results  would  have  been  trivial  to  prove  directly, 
however.  It  is  on  the  types  for  which  y  >  0  that  Theorem  3  sheds 
the  most  light.  For  example,  if  (0^,y*)  and  (B3,y")  are  both  in 
region  I+,  then  (B^.y)  may  fall  only  in  I+..  Ill,  VI,  V,  IV,  and 
VII.  However,  if  (0^, y')  and  (0£,y")  fall  in  the  Type  VI  region, 
then  (0. ,y)  must  be  in  VI,  V,  IV,  or  VII.  By  using  both  Theorem  2 


and  Theorem  3,  one  can  conclude  that  ($^,y)  will  be  in  either  IV  or 
VII  when  (3^,y')  and  (B^*y")  are  in  that  part  of  the  IV  region  for 
which  0  <  B,  <  1.8  and  2+e  <  . 

1  “  h 

The  most  Interesting  application  of  Theorem  3  is  to  Type  III 
random  variables.  Suppose  that  and  both  have  gamma  distri¬ 

butions  with  support  on  [0,“).  Then  the  densities  of  X.  and  X„ 


are  Type  III  Pearson  curves,  so  that  (B^.Y*)  and  (3^,y")  fall  on 

y  3  *  A 

the  line  g  “  t*ie  first  part  of  Theorem  3,  ($^,y)  must  fall 

in  III,  VI,  V,  IV,  or  VII.  However,  the  fact  that  and  X 2  are 

gamma  distributions  with  right  tails  implies  /B^  >  0  and  v^B^  >  0  . 


/z~  ~  ~ 

This  in  turn  implies  /B^  >  0,  so  that  (B^,y)  will  not  fall  in  VII. 

Now  we  can  apply  the  condition  for  equality  in  Theorem  3.  When  X^ 


and  X£  are  both  gamma  random  variables,  the  condition  for  equality 
in  Theorem  3  is  equivalent  to  the  condition  that  the  scale  parameters 
of  X^  and  X2  be  the  same.  Two  gamma  random  variables  with  the  same 
scale  parameter  are,  at  least  in  a  limiting  sense,  sums  of  iid  copies 
of  the  same  random  variable.  (Recall  that  a  gamma  random  variable  with 
shape  parameter  k  and  scale  parameter  X  can  be  thought  of  as  a  sum 
of  k  independent  exponential  random  variables  with  parameter  X.  This 
interpretation  is  useful  even  when  k  is  not  a  integer.)  Thus,  we  are 
essentially  back  in  the  case  covered  by  Theorem  1  when  the  equality 
condition  of  Theorem  3  holds  for  gamma  random  variables.  On  the  other 
hand.  Theorem  3  Implies  that  Xj+X2  will  have  the  first  four  moments 
of  a  Pearson  curve  of  Types  VI,  V,  or  IV  when  X^  and  X2  have  dif¬ 
ferent  scale  parameters.  Thus,  the  sum  of  two  gamma  random  variables 
with  different  scale  parameters  cannot  have  the  first  four  moments  of 
a  gamma  random  variable. 


25 


I 

I 

|  It  may  also  be  enlightening  to  look  more  closely  at  what  can 

happen  when  (8-[,Y*)  and  are  in  I+.  By  Figure  4,  it 

is  possible  for  X^  to  have  a  beta  distribution  such  that 
(8{,y')  “  (!»!)•  Suppose  this  holds  if  X^  ^  $(p,q),  and  that 
i/fT^  -  1.  Suppose  further  that  X2  ^  3(q,p).  Then  (BpY")  *  (1»1)» 
but  "  ~1 •  Note  also  that,  modulo  a  location  shift,  X2  will 

have  the  same  distribution  as  -X^.  In  this  case,  we  will  have 

A  A 

(Y»0^)  ■  (  y.O)  e  VII.  Tlus,  the  sum  of  two  beta  random  variables 
can  have  the  same  first  four  moments  as  a  t  distribution.  This  will 
be  the  case  whenever  (0^,y')  e  I+  and  X2  has  the  same  distribution 
as  -X^ . 

Now  suppose  that  (3^,y')  ■  (1,1),  and  that  (0^,ym)  ■  (0,0). 
Thus,  X2  will  have  the  same  first  four  moments  as  a  normal  distri- 

A.  A* 

but ion.  Calculation  of  0^  and  y  yields 

A  (K-+L-)2  K? 

B  -  3-  - - ^—3 

(K2+L2)<t  (K2+L2)j 

and 

-  k4+l4  k4 

Y - 2 - 2  » 

(k2+l2)^  (k2+l2)^ 


since  Lj  - 


0.  Since 


8i 


Y’ 


K4 

— 2  ,  we  have 


;  Y|  Vl2 

sl  *i  K2 


VS 


Now  K2  and  L2  can  be  chosen  independently  of  (0|,y')  and  (8£,y")« 


26 


This  Is  true  because  K2  is  just  the  variance  of  X^,  so  that  K2 
can  be  varied  by  scale  transformations  which  leave  (3|,y')  unchanged. 

The  same  holds  for  L2  of  course.  Thus,  by  properly  choosing  K2 

C  k2+l2 

and  L_,  *  ■  — - —  can  be  made  equal  to  any  given  number  in  (l,00). 

1  8  K2 

This  result  and  Theorem  2  Imply  that  (y,$^)  can  be  made  to  fall  into 

|  A  A 

any  of  I  ,  III,  VI,  V,  and  IV  in  this  case.  Since  3^  and  y  are 
continuous  functions  of  the  cumulants  of  X^  and  X2,  it  is  not  hard 

A  A  . 

to  see  that  (0^,y)  can  fall  into  any  of  I  ,  III,  VI,  V,  and  IV  even 
when  (S|,y')  and  (8^,y")  are  in  I+  and  /(P"  has  the  same  sign 
as  S&l  . 

Interest  has  been  expressed  in  the  fitting  of  Pearson  curves  to 
sums  of  Independent  chi  random  variables.  Results  contained  in  Elandt 
(1961)  are  helpful  here.  The  Elandt  paper  gives  formulas  for  the 
moments  of  noncentral  chi  random  variables.  It  also  contains  a  diagram 
(Figure  1,  p.  555)  showing  how  the  (6^,62)  pair  for  a  noncentral  chi 
moves  through  the  ($^,02)  plane  as  the  noncentrality  parameter  changes. 
Comparison  of  this  diagram  with  Figure  1  on  page  13  of  this  paper  shows 
that  a  chi  random  variable  is  always  best  fitted  by  a  Type  I  Pearson 
curve.  By  Theorem  1,  any  sum  of  finitely  many  lid  chi  random  variables 
is  also  best  fitted  by  a  Type  I  Pearson  curve.  The  question  of  whether 
this  is  true  for  nonldentically  distributed  summands  now  arises.  The 
following  shows  that  the  best  fitting  Pearson  curve  for  the  sum  of  a 
central  chi  random  variable  and  an  independent  noncentral  chi  random 
variable  can  be  of  Type  I,  III,  IV,  V,  or  VI. 

Let  X^  be  a  central  chi  random  variable  arising  from  taking 
the  absolute  value  of  a  N(0,1).  Let  X2  be  a  N(0,1)  random 
variable  Independent  of  X^.  It  will  now  be  shown  that  X^+X2  has 
the  first  four  moments  of  a  Type  IV  Pearson  curve. 


.  -  Ox' 


It  is  easy  to  find  the  first  four  cumulants  K^K^R.^,  and 
of  X^  from  the  first  row  of  Table  1  in  Elandt  (1961).  The  calcula¬ 
tions  imply 


0.7979 


0.3634 


K3  -  0.21804 


K4  -  0.11473  . 


The  first  four  cumulants  of 


are  of  course 


l2  -  1.0 


If  we  again  use  B,  and  B-  for  the  (skewness)  and  kurtosis 


of  X1+X2,  we  get 


(k3+l3)‘ 


0.21804) 

l  n  /  o  \  ^ 


1  (K2+L2)3  (1.3634) 


0.01876 


K4+L4 

B 2  -  3  +  -■  *2 

ot2+L2r 


-  3  + 


0.1174 

(1.3634)' 


3.06172  . 


Also,  Y  ■  B~  -  3  -  0.06174.  Thus,  ■  3.290.  It  is  easy  to  see 

2  „  „  h 

from  Figure  3  that  (B^»V)  £  IV. 

Since  the  second,  third,  and  fourth  cumulants  of  a  chi  arising 
from  N(]i,l)  approach  those  of  a  normal  N(0,1)  as  jj  -►  “>,  the 
continuity  of  8^  and  y  as  functions  of  the  cumulants  implies  that 
the  sum  of  the  central  chi  |N(0,1)|  and  the  noncentral  chi  |N(y,l) 


/  V  V  .  \  ■  ‘  *  •  *  *  *  '“V*  s'-*.  •  *  -  .o  '-  .  .  • 

.  S  •  *.  *.  •  •  *  v  *  •  •  •  >  1  •  •  .  *  SL  “  I  ^  V*:  .  - 


28 


will  have  (3^,Y)  In  IV  for  y  sufficiently  large.  As  y  varies 
from  0  to  •,  the  (3^.Y)  pair  will  trace  out  a  continuous  curve 
in  the  (S^,y)  plane  which  starts  in  the  Type  I  region  and  ends  in 
the  Type  IV  region.  By  continuity  and  the  fact  that  is  positive 

everywhere  along  this  curve,  the  curve  must  pass  through  the  regions 
for  Types  I,  III,  VI,  V,  and  IV.  Calculations  using  moments  for 
! N(3 ,1) |  obtained  from  the  last  row  of  Table  1  in  Elandt  (1961)  show 
that  (B. ,y)  Is  still  in  I+  when  y  -  3. 


References 


Elandt,  Regina  C.  (1961).  The  folded  normal  distribution:  Two  methods 
of  estimating  parameters  from  moments.  Technometrics,  J3,  551-562. 
Johnson,  N.L.  and  Kotz,  S.  (1970).  Continuous  Univariate  Distributions  -  1, 
New  York:  Houghton  Mifflin. 

Rhind,  A.  (1909).  Tables  to  facilitate  the  computation  of  probable 
errors  of  the  chief  constants  of  skew  frequency  distributions. 
Biometrika.  ]_t  127-147. 


-i  t  .Oi  ,JL  -I  i* 


SECURITY  CLASSIFICATION  OF  THIS  PAOE  (VLan  Ota  fcWI< 


REPORT  OOCUMENTATIOH  PAGE 


4.  TITLE  fan*  tubtltla) 

ON  THE  FITTING  OF  PEARSON  CURVES  TO  SUMS  OF 
INDEPENDENT  RANDOM  VARIABLES 


THOMAS  SELLKE 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


1.  RECIPIENT'S  CATALOO  NUMBER 


S.  TYRE  OF  REPORT  *  PERIOD  COVERED 

TECHNICAL  REPORT 


I-r.T'W.TT-ltf.T.ET.7 


N00014-76-C-0475 


».  PERFORMINO  ORGANIZATION  NAME  AND  A 


DEPT.  OF  STATISTICS 

STANFORD  UNIVERSITY  -  STANFORD,  CALIF. 


(code  aiksp) 

OFFICE  OF  NAVAL  RESEARCH 

ARLINGTON,  VA.  22217  _ 


OWm)  T  is.  SECURITY  CLASS.  fal  (Ma 


nCy  name  s  AOOR 


UNCLASSIFIED 


1/  /jfi  \  TT JT-fA  liT-T'Ti  »MTTT  TT*1 


iriwrnrTTm/Tirw^TwrT  nnrrr nr 


APPROVED  FOR  PUBLIC  RELEASE:  DISTRIBUTION  UNLIMITED. 


IT.  DISTRIBUTION  STATEMENT  (al  Nia  ak atraal  MNrM  In  Bfaa ft  H.  II  mutant  *aa «  RapatO 


IS.  KEY  BOROS  (Canlhma  an  rararaa  al*  II  MMiaar  «M  ManM*  Sr  UmI  —Bar) 

Beta  distribution,  Pearson  curves,  sums  of  independent 
random  variables. 


SB.  ABSTRACT  (Canttmm  an  favaraa  Mtft  U  naaaaamry 


PLEASE  SEE  REVERSE  SIDE 


REPORT  NO.  333 


ON  THE  FITTING  OF  PEARSON  CURVES  TO  SUMS  OF 
INDEPENDENT  RANDOM  VARIABLES 
By 

Thomas  Sellke 


ABSTRACT 


It  is  shown  that  the  distribution  of  a  sum  of  independent  beta 
random  variables  is  often  well  approximated  by  a  properly  scaled  beta 
distribution.  The  relationship  between  the  type  of  Pearson  curve 
which  best  fits  a  sum  of  independent  random  variables  and  the  types 
of  the  Pearson  curves  which  best  fit  the  summand  random  variables 
is  also  investigated.  The  best  fitting  Pearson  curve  for  a  distri¬ 
bution  is  defined  here  to  be  the  unique  Pearson  curve  with  the  same 
first  four  moments. 


UNCLASSIFIED _ _ 

SgSWMTV  CkMamCATIM 


